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Abstract 

A class of measure-valued processes which model multilevel multitype populations under¬ 
going mutation, selection, genetic drift and spatial migration is considered. We investigate 
the qualitative behaviour of models with multilevel selection and the interaction between the 
different levels of selection. The basic tools in our analysis include the martingale problem 
formulation for measure-valued processes and a generalization of the function-valued and 
set-valued dual representations introduced in Dawson-Greven (2014). The dual is a powerful 
tool for the analysis of the ergodic behaviour of these processes and the study of evolutionary 
systems which model phenomena including altruism, the emergence of cooperation and more 
complex interactions. 
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1 Introduction 

Multitype populations are naturally modeled as measure-valued processes. In this paper we 
consider a class of multilevel measure-valued processes which model ensembles of subpopulations 
with mutation and selection at the subpopulation level and possible death and replacement of 
subpopulations. In particular this includes mathematical models of multilevel selection which has 
been the subject of considerable debate in the evolutionary biology literature. Before introducing 
our multilevel models we begin with a brief review of some of this literature. 


1.1 Hierarchical population structure 

The hierarchical structure of populations plays a fundamental role in the biological and social 
sciences. In evolutionary biology and ecology the hierarchy includes ecosystems, community, 
species, organism, genes and in the social sciences we have cites, regions, nations, etc. These 
are systems in which at each level of the hierarchy we have a collection of elements of the next 
lower level in the hierarchy. The description of a unit at a given level in the hierarchy involves 
the distribution of the different characteristics of the individuals at the next lower level in the 
hierarchy. 


1.2 Historical remarks on hierarchy in population genetics and evolutionary 
biology 

Biological evolution can be viewed in terms of a hierarchy of levels of organisation going from 
the molecular level to the species level and social of groups of members of a species. A natural 
question is to what extent does the Darwinian mechanism of variation and selection of htter 
types in a competitive environment play a role at the various levels. 

An early application of group selection was by Wynne-Edwards (1962) |^] who used it to 
explain adaptations and social behaviour of animals. Subsequently G.C. Williams (1966) [^] 
made a highly critical analysis of group selection which was very influential. John Maynard 
Smith (1964),(1976) (|Q,|Q) considered both group selection and kin selection which was 
introduced by W.D. Hamilton (1964) and concluded that there may be conditions under 
which group selection is effective. In the subsequent decades there has been intense debate 
among evolutionary biologists about the extent to which evolution has been shaped by selective 
pressures acting at the level of groups of individuals. 


In recent years the role of multilevel selection has re-emerged in a number of contexts includ¬ 
ing the emergence of life (Szathmary and Demeter |^|), structural complexity (Gornerup and 
Crutchfield (2008) |Q), prebiotic evolution (Hogeweg and Takeuchi |^), plasmid replication 
in bacteria (Paulsson (2002) |^^), evolution of cooperation (Traulsen and Nowak [^]) and so¬ 
ciobiology (Wilson and Wilson) [p^]. In the study of cultural evolution Boyd and Richardson 
1 ^ suggest that interdemic group selection can be important when there are multiple stable 
equilibria at the deme level and the emergence of higher level equilibria occurs. Moreover these 
ideas are relevant in the context of spatially structured populations and evolutionary ecology 
(see Lion and van Baalen |H], Lion et al ^^). A detailed study of host-pathogen systems 
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in the framework of multilevel selection was carried out by Luo (2013) [^] and Luo and 

Mattingly [^] who demonstrated that a phase transition between the dominance of selection 
at the different levels can occur as the model parameters are varied. Multilevel selection also 
underlies current research in the development of complex human societies (see e.g. Turchin et al. 
[^5| ). Several books have been written on the question of the levels of selection. These include 
Brandon and Burian |11|, Sober and Wilson |76], Keller ||3^ and Okasha An number of 
other recent research papers on multilevel selection are included in the References. 


We end with a quotation of Leigh that provides a useful perspective on these questions: 
“These conditions ( e.g. he quotes Kimura’s conditions - see !&) seem so wonderfully im¬ 
probable that, following Williams (1966), most biologists have focused almost exclusively on in¬ 
dividual selection. Improbability, however, does not mean impossibility. Group selection capable 
of overwhelming seleetion within groups, played a crucial role in some major transitions ...”. 


An objective of this research is to develop tools to identify conditions under which higher level 
selection is relevant for the class of mathematical models we consider. 


1.2.1 A multideme model with two types of individuals 

In order to introduce the main ideas we briefly review a formulation of group selection given by 
Aoki 1^]). This begins with a countable collection of demes where each deme is a population of 
n individuals which are either type A or type B. Type B individuals are altruistic and add to 
the fitness of the deme. The life cycle of a deme involves four discrete events, namely, migration, 
reproduction, extinction and recolonization. In the reproduction stage, within each deme the 
population undergoes weighted finite population resampling in which type B has fitness —s 
(with s > 0). The probability that a deme suffers extinction is a monotone decreasing function 
of the proportion of type B individuals it contains, that is, the fitness of the deme increases as 
a function of the number of altruistic individuals it contains. In the migration stage a random 
number of individuals within a deme are replaced by individuals chosen at random from the total 
population pool. Aoki then obtained a recursion formula for the probability distribution of the 
number of individuals of type B per deme over successive life cycles and discussed the question 
of the long time behavior of this distribution, in particular whether or not the proportion of 
type B goes to zero or not. 

1.2.2 A diffusion process model of multilevel selection with two types 

The class of Wright-Fisher diffusion processes plays an important role in population genetics. 
Following AokiJ0), an analogous extension of the Wright-Fisher process was introduced by 
Kimura (1983) with alleles A and B distributed in an infinite number of competing demes. 
It is assumed that B is the altruistic allele which has a selective disadvantage si but which is 
beneficial for a deme in competition with other demes, namely, a deme having frequency x for 
B has advantage S 2 {x — f yv{dy)) where v{dy) is the distribution of the frequency of type B 
individuals over the set of demes. This leads to the integro-differential equation for the dynamics 
of the density {i'(t,x)}t>o where iy{t,dx) = i'(t,x)dx : 
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di>{t, x) 7 i 


2 dx"^ 


{x{l - x)v{t, x)) - x)i^{t, x)) + S 2 - y yv{t, y)dy^ v{t, x) (1.1) 


dt 

where 


and mi 2 , rn 2 i are the mutation rates 1 ^ 2 , 2 —>■ 1 respectively, c is the rate of migration between 
colonies and the resampling rate 71 is inversely proportional to the effective population size at 


M(t, x) = m 2 i(l — x) — mi 2 X + c ( / yi'it, y)dy — x j — six(l — x), 


a deme. This model will be discussed in detail in subsection 4.2.1 including Kimura’s analysis 


which was based methods of ordinary differential equations as well as the analysis using the dual 
representation which will be developed in section 3. The duality method is not restricted to two- 
type systems (as is the case for the ode method) and can be used to study general multitype 
systems. 

1.3 Multilevel multitype measure-valued processes 

The natural framework for multilevel multitype models with random effects at different levels is 
the setting of multilevel measure-valued processes. Models of this type were first developed for 
multilevel branching systems in Dawson and Hochberg 11 and Etheridge The long-time 
behaviour of multilevel measure-valued processes is investigated in Wu Dawson and Wu 


[^1, and Gorostiza, Hochberg and Waklbinger [^|. In particular two-level measure-valued pro¬ 
cesses have state spaces of the form A4(A4(E)) for some Polish space E where Ni{E) denotes the 
space of Borel measures on E. In this paper we work with an analogous class of two level prob¬ 
ability measure-valued processes formulated in terms of a well-posed martingale problem which 
generalizes the Kimura model to systems with more than two types, more complex interactions 
and to the diffusion limit of systems with finitely many demes. 


1.4 Outline of the paper 

The main objectives of this paper are to formulate a general measure-valued framework for 
multilevel population systems with mutation and selection and to develop the method of duality 
for multilevel measure-valued stochastic processes with applications to population systems with 
multilevel selection. In section 2 we introduce the class of models characterized as solutions to 
a well-posed martingale problem. In Section 3 we introduce the dual processes used to establish 
that the martingale problems are well-posed and to compute joint moments. These are given 
by the multilevel generalization of the class of function-valued and set-valued dual processes 
introduced in Dawson and Greven In Section 4 we consider the long-time behavior of 

systems with two types and with two levels of selection. In Section 5 we introduce some more 
complex models of systems with K > 2 types and with multilevel selection as well as further 
possible extensions of these models and methods. 
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2 Multitype-multilevel mutation-selection models 

2.1 A two level finite population model 

We begin with a two-level finite population model given by an exchangeably interacting system 
of Moran particle systems with selection at both levels. We assume that the higher level fitness 
of a subpopulation can depend on the distribution of level I types within the subpopulation. In 
other words at the group level the fitness V 2 (//) of a subpopulation described by its distribution 
/j over the space of types, is a result of a network of interactions. Then the resulting distribution 
of the collection of subpopulations {/Uj}iGS is formulated in the setting of multilevel measure¬ 
valued processes which provide a natural setting for the study of hierarchical systems of this 
type. 

We begin with a simple colony containing W individuals described by a Moran model. Each 
individual has type inl = {l,...,it'}. We let 

Uk := number of individuals of type /c G {1 ,..., A} 

K 

Ni = y^^nk 

k=l 

and we think of the normalized vector ^(ni,..., ni^) as an element of V{T), the space of 
probability measures on {1 ,... , A}, 


( 2 . 2 ) 

(2.3) 


A = 


1 ^ 


Ni 


k=l 


where the single atom measure 5k represents an individual of type k. 


The dynamics of a simple colony is given by a continuous time Markov chain, {Xt '■ t > 0} 
with state space V{T). 

The dynamics includes: 

• Mutation: given by transition rates {mij}ij£i, that is, the rate at which an individual of 
type i is replaced by an individual of type j 

• Sampling: at rate ^ an individual of type i is replaced by an individual of type j where 
j is chosen from the empirical distribution X 

• Selection with fitness function Vi : I —)■ [0,1] and intensity si. 


The resulting transitions for the probability-measure-valued process are given by 




1 . 1 e 

—di -I- —dj at rate mijix[i) 

]N\ l\l 

at rate ((W - l)y -h 


(2.4) 


(2.5) 
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for i,j € I. Note that these assumptions result in a rate of change dX{t,i)/dt due to mutation 
and selection of order 1 /Ni which turns out to be the same as the order of the sampling fluctu¬ 
ations when 7 i > 0. This corresponds to the case of weak selection in population genetics and 
the diffusion limit below will involve a time speed-up by a factor of Ni. In population genetics 
the parameter 71 is viewed as inverse effective population size (see Remark 5.4 in |^|) and is 
a measure of the population size in relation to the selection intensity in the finite population 
model. 

We now consider a collection of N 2 colonies (demes) each consisting of Ni individuals with 
internal dynamics within each colony given as above. In addition (as in the Aoki model) there 
is an interaction between colonies via migration. To model this, individuals within a colony die 
and are replaced at rate c > 0 by a new individual with type given by that of a randomly chosen 
individual in a randomly chosen colony. 

The final mechanism is death (extinction) and replacement of colonies following the same 
mechanism as the sampling mechanism within colonies. That is, a colony dies and is replaced 
by a copy of a randomly chosen colony. In addition we can include deme-level selection using 
a level II fitness function V 2 {ff) and selection intensity S 2 - For example, we can take a linear 
htness function of the form 

1^2(fi) = J V2ix)n{dx) 

We then consider the empirical measure 


^ i=l 


N2 


where fii denotes the state of the ith colony, namely, 


1 ^ 

1 n-ij 


i=i 


and Ujdenotes the number of type j individuals in the ith colony. 

The resulting transitions due to the level one dynamics are of the form 

at rate mijn{i) ((iVi - l)'^ + siVi{j)) n{i)n{j)n{dff 

The resulting transitions due to level II sampling and selection for the measure-valued process 
are given by: 

iz ^ {n-\- + ^^ 2 )) i^ate {s 2 V 2 {n 2 ) + y (^2 - 1 )) v{dpLi)v{dpL 2 ) 

We can then consider the limiting behaviour as A^i or N 2 go to 00 , or we can allow them to go 
to infinity simultaneously with for example N 2 = 'r]Ni. In the following subsections we will first 
consider the limit as iVi —>■ 00 for a finite system of N 2 demes leading to a system of interacting 
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Fisher-Wright diffusions. We then consider the exchangeable system of Fisher-Wright diffusions 
with additional death and random replacement of demes as described above and then obtain a 
two level measure-valued diffusion process (called the two level Fleming-Viot model) by letting 
N 2 —)■ 00 . 


2.2 The martingale problem formulation 

The framework in which we specify the different stochastic models is the class of probability¬ 
valued Markov processes X{t) E P{Ei) where Ei is a Polish space. Let De^ ([0, 00 )), {Ce^ ([0, 00 ))) 
denote the class of cadlag (resp. continuous) functions from [0, 00 ) to Ei. We denote by {Ft]t>o 
the natural filtration of u-algebras on these spaces. The probability law P E V{De^{[0,oo))) is 
said to be a solution of the martingale problem with generator {G, D{G)), where G is a linear 
operator on D{G) C C{Ei) and D{G) is measure-determining on Ei, if 

ME{t) := F{X{t)) - f GF{X{s))ds 

Jo 

is an Jj-adapted P martingale for all F E D{G). 

The martingale problem method is used to characterize stochastic processes of interest in many 
applications. The method (which we will also use below) consists of four steps: 

(1) to construct a sequence of approximating processes with probability laws Pn E V{Dei{[0, 00 ))) 
that satisfy some simple martingale problems, 

(2) to show that the laws of the processes are tight, that is, relatively compact in V{De^ ([0, 00 ))) 

(3) to show limit points of the Pn satisfy the martingale problem defined by {G,D{G)), and 

(4) to prove that there is a unique solution to this martingale problem thus characterizing the 
limiting probability law P of the process of interest. We will use this method to define the 
Fleming-Viot process that models selection at two levels. 

A key tool used to establish the uniqueness of solutions is duality. This is achieved by con¬ 
structing a dual process Qt with state space E 2 and function F : V{Ei) x E 2 ^ such that 
the functions {F{-,g),g E E 2 } are in D{G) and are measure-determining on Ei, and the duality 
relation: 


Exio){F{X{t),go))=Eg,{F{X{Q),gt)) 


is satisfied for all f/o € E 2 for all V(0) E V{Ei) (where the right side denotes the expectation 
with respect to the law of the process). The class of set-valued dual processes we use in the 
study of multilevel mutation selection systems is developed in detail in Section 3. In addition 
to using the dual to establish that the martingale problem has a unique solution it will be used 
below to compute moments, to obtain fixation probabilities and to prove ergodicity. 


For general background on the martingale problem formulation see 


m, 0 and 


2.3 Diffusion process limits 

In this subsection we will identify the limit of the process {H 7 V 2 ,Ari (t)}t>o as Vi —>■ 00 for fixed 
N 2 < 00 . 
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2.3.1 The limiting single deme diffusion process 

We first consider the special case in which N 2 = 1 and let Ni —>■ cx). 

Proposition 2.1 The limit as Ni ^ 00 of the single deme (N 2 = I) normalized empirical 
measure with diffusion scaling leads (with time speed-up t —>■ Nit)) to a K-type Fleming-Viot 
process (equivalently, a finite type Wright-Fisher diffusion) which is characterized as the unique 
solution of the martingale problem given with generator 


K l K 

G'°/(x) = ^ I 

i=l \j = l 

K / K 


5/(x) 

maXi—maXi] \ —— - mutation 


dxi 


( 2 . 6 ) 


+ Sl Vi{k)xk\ 


9/(x) 


+Y 


i=l 

K 


k=l ) 

d‘^f{x) 






dxi 
genetic drift 


selection 


defined on the class D{G^) given by the class of functions f with continuous second derivatives 
on the simplex Nk-i = {{xi, ..., xk), > 0, — !}• 


Proof See for example 


Theorem 2.7.1 for the neutral case and 


Theorem 10.2.1 for the 


proof of uniqueness for the case with selection. (Also see |^], Chapter 10, Theorem 1.1 for the 
derivation of the diffusion limit starting with a discrete generation model.) □ 


2.4 Exchangeable system of Wright-Fisher diffusions 

We now consider a system of demes labeled by 5 = {1,2,..., N 2 } where the population at each 
deme undergoes mutation and selection as in the single deme process but in addition individuals 
can migrate between demes at rate c and the population in a deme can become extinct at 
rate S 2 and be replaced with population /i sampled from the empirical distribution of deme 
compositions. With selection, the replacement deme type is chosen with weights proportional 
to the level II fitness 

0 < V2{fi) <1, /i G P(I). 


2.4.1 Deme level fitness functions 

In order to incorporate fitness at the deme level we must introduce an appropriate class of fitness 
functions. It is natural to assume that the fitness of a deme (subpopulation) is a function of 
the distribution of level I types within the deme given by V2{h) when the distribution of types 
within the deme is fi G V{I). We also assume that V 2 is a bounded and continuous function of fi 
(in the topology of weak convergence). Without loss of generality (by the addition of a constant 
if needed) we can assume that V 2 {fi) > 0. 
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Example 1 Consider the speeial case I = {1,2}, and 

E2(/^)) = /(M1))>0. (2.7) 

Then (see e.g. Lorentz (1963) Ml (Chapt. 1, Theorem 4)), we can uniformly approximate V 2 
using Bernstein polynomials as follows 

V 2 {ix) = lim (2.8) 

k,i 

where the coefficients an,k > 0. 


In general, given a compact Polish space E we consider the space V{E) of probability measures 
on E with the topology of weak convergence. We then consider the Bernstein operators : 
C{V{E)) —>■ C{P{E)) (where C{V{E)) is a normed space with the supremum norm) dehned by 


B^fiL) = 


I I'iip-) 


fi{dxi)... fj,{dxK) 


(2.9) 

Then by (Dawson and Gartner ||^ Theorem 3.9 ) for any / G C{V{E)) 

B^f ^ / in C{V{E)). (2.10) 

This means that we can approximate any bounded continuous htness function V 2 € (7+(7^(1)) 
by 

B^V 2 {p,) = J ... J h{xi,... ,XK)L{dxi)... nidxK) (2.11) 

where h is a bounded non-negative function on (I)^. This can be rewritten in the form 

. K 


B^V2{g) = Y.s, 


i,K 


i=i 


( 2 . 12 ) 


where for each i the AK,i,j are subsets of I. We denote the class of htness functions of the form 
(2 .12| ) by Vk and note that we can approximate any bounded continuous htness function by a 
functions in V := UkVk- 


Example 2 Types I = (1, 2}. If //(I) = pi, /r(2) = p 2 , then 

y2{L)=PiP2, i'i-- V2{fJ.)) = pI-\-pI-\- piP2 (2.13) 


P2 (m) = T^iC), 1 - V2{t2) = (2.14) 

where C = {1} 0 {2}. 

Example 3 Consider the 3 type case I = {1,2,3} with fitness functions as follows: 


b^i(l) = si, V2{p) = S2p{2)n{T). 
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Example 4 Model with 3 types I = {1,2,3} and mutualistic (state-dependent) fitness. 

• Vi(l,/x) = si/i(2), Vi{2,p.) = sip.{l), El(3) = 1/2 

• Level II fitness is V 2 {p.) = S2[^Ai(3) + 2si//(l)/i(2)]. 

This can be analysed using the set-valued dual as indicated in Remark |^. 

Example 5 V 2 {p) is positive iff the population contains a certain set of properties (from a finite 
set). 


V2ifi) = (2-15) 

l-E2(/,) = ^e,/i®(yl{) (2.16) 

where > 0, Y1,D = I, C (I)^. 

2.4.2 The limiting generator as A^i —> cxd and N 2 < oo 

The generator for the resulting model of N 2 interacting demes: for F G C'^('P(I)^^), with 
X:= (xi,...,xiv,)€(^(I))'^2 


N 2 

= p^GlF{X) 

?=i 


(2.17) 


N 2 




C=i 


K N 2 




mutation-selection dynamics at each site 
dF{X) 


j=i \^'=i 


dxffff) 


migration 


N 2 / N 2 \ 

+S2 E wE V 2 {x{f^))F{^^^rX) — E(X)] I deme replacement 
?=i C'=i / 


N 2 N 2 


+ ^72 EE [E(<l>^^/X) — E(X)] deme resampling 


«=U'=i 


where <l>gg/X = (xi,... ,x^,... (corresponding to the replacement of x^ by x^) and r] 

is a parameter that depends on the relation between the natural time scales at the two levels. 

The martingale problem with generator (7^2,mt ^ unique solution that defines a cadlag 

strong Markov process {xf^^}i>o with state space (7^(1))^^. The proof follows as in the proof 
of Proposition 2.1 but where the dual process needed to show that the martingale problem is 
well posed in given in Subsection 3.1. 
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2.5 Empirical measure-valued processes and the Fleming-Viot limit 

We will next consider the limit as N 2 —>■ 00 in the general case in which we can have S2 > 0 
and/or 72 > 0. We assume that the initial state satisfies (/xi(0),... , fJ-N2{0)) is exchangeable. 

Lemma 2.2 Consider the Markov process X(t) = (xi(t),... ,xjv2(t)) € with generator 

QN 2 ,int^ Assume that the probability distribution o/X(0) is exchangeable (i.e the distribution is 
invariant under permutations of {1,2,... ,N 2 }). Then (xi(f),...,x^r^(t)) is an exchangeable 
system of V{T)-valued diffusions. 


Proof This follows since the migration and level II selection terms in the generator are invariant 
under permutation - see for the general case of exchangeable diffusions. 

□ 


The level II empirical process is defined by 


1 ^'•2 

^ WI))- (2.18) 

^ i=i 


Then by Lemma 

the interacting system. To describe this we consider the algebra of functions, D{G^^), on 
V(V{T)) containing functions of the form 


2.2, 'B^^{t) is a P(P(I))-valued Markov process with generator inherited from 


where 


K . 

= / 

k=l L"' 


hk{hk)v{dp.k) 


(2.19) 


lAf,ij) that is, a polynomial on P(I). (2.20) 

j * 

We then define the generator in terms of the generator of the interacting system as follows: 




where H € D{G^^) and 


N 2 


V = 


N 2 




i=i 


( 2 . 21 ) 


and ( 7 ^ 2 ,mt -g g^ygj^ (|2.17 ). 
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Theorem 1 {^1, 

Assume that 72 > 0 and V 2 € V. Then 

(‘='t)tg[o,T] as A'"2 —>■ 00 

where Et{dx) € C''P( 7 >(i))([ 0 , T]) is the two level Fleming-Viot process with level two selection 
given by the unique solution, {Pj^ : n G V{V(I))}, to the well-posed (^ 2 ,^ 2 ) martingale problem 
where the domain D 2 C C'(P(P(I))) consists of the algebra of functions containing functions of 
the form ( \2.1b ) and the generator acting on D 2 is given by 


( 2 . 22 ) 




G2H{u) = VG' , . 

Jp{I) 


^{dpi) 


v{dpL2)pi2{dx) — pLl{dx) 


^{dfii 


+ 


Jv{i)Ji \dhi{x) 

[ T-J^ ^ 2 ( 1 ^ 1 ) - [ V2in2)i^idn2) 
J-p(l) Jpd) 


where is given by (\2. (\). 


iy{dfii 


Proof We follow the standard argument which involves three steps: proof of the tightness of 
the laws of the processes, proof of convergence of the generators on a sufficiently large class of 
functions and finally proof that the martingale problem associated with the limiting generator 


has a unique solution. The first two steps follow in the usual way (e.g. proof of |^|, Theorem 
5.3.1). It then remains to prove the uniqueness - this will be proved in the next section after 
introducing the appropriate class of dual processes. 

□ 


Remark 1 An alternative class of functions, D 2 , is the linear span of functions of the form 



Xk,nk)hk{dXk) vidpLk) 


(2.23) 


We also consider the convex set V 2 of [0, l]-valued functions which contain functions of the 
above form with h having values in [0,1]. Note that this class uniquely determines probability 
measures on V{V{T)). 
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Remark 2 The multilevel Fleming- Viot proeess is the analogue of the multilevel superprocess 
- see e.g. Dawson-Hochberg (1991) J7^/ , Etheridge (1993) Wu (1994) Gorostiza- 
Hochberg-Wakolbinger (1995) Dawson-Hochberg-Vinogradov (1996) Dawson and Wu 
(1996) Dawson-Gorostiza-Wakolbinger (2004) /jH/- 


Remark 3 In the special case S 2 = 0 and 72 = 0 we obtain the mean-field limit. We consider 
a tagged colony - by exchangeability this can be colony 1, fii^{t). Then as N 2 ^ 00 in the limit 
we obtain the measure-valued McKean-Vlasov dynamics (cf. f^) given by the solution to the 
martingale problem with nonlinear generator 

Gi^iT) = G^F{g) + c [ [ iz{dfi){n{dx)) - fj.{dx)] (2.24) 

and the law of the process, € V((P{I)) is the weak solution of a nonlinear second 

order partial differential equation. 

If we assume 72 = 0 but S 2 > 0, then Et is still deterministic and is the solution of a nonlinear 
second order partial differential equation which is a generalization of Kimura’s equation O- 
Depending on the functions Vi,V 2 and with recombination these nonlinear equations can exhibit 
a range of behaviors including multiple equilibria and possible periodic or chaotic behaviour (see 
Akin l^). In the general case with 72 > 0 we obtain a two level Fleming- Viot process. 


3 Duality for interacting and two-level Fleming-Viot systems 


In this section we introduce a basic tool, namely the generalization of the class of set-valued dual 
introduced in \M\ to the class of two level probability-measure-valued processes E{t) G V{V{T)) 
which were obtained in the previous section. These processes satisfy the martingale problem 
with generator G 2 . 


Mnit) := HiEt) - / G2H{Es)ds (3.25) 

Jo 

is a P — martingale for all H G ^ 2 (^ 2 )- 

The dual process developed here will be used to prove that there is a unique law 
P G P(C'-p(p(][)) ([0, 00 ))) which satisfies the martingale problem ( 3.25 ). As explained above the 
idea is to find a dual process and to establish the duality relation 




'H(0)(P(H(t),e2)) = Eg2iFiEi0)),g^)). 


(3.26) 


We begin by obtaining the dual for the system of interacting P(I)-valued processes with 
generator ( 7 ^ 2 ,given by ( p.l7 ). For detailed background on the duality method to be used 
refer to @ Chapter 5. 


In 
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3.1 A fund ion-valued dual 

We now introduce a function-valued dual for the process with generator state space 

for the function-valued dual is the set of functions, H, of the form 

Ei nS n"i. By inspection of the action of the generator ( 7 ^ 2 ,functions in 

H, we can read off the corresponding function-valued transitions corresponding to mutation, 
selection and migration as follows. 

• Level I Selection with Vi{x) = ls(x). 

Transitions at rate si 


h{xi,... ,Xn) ^ lB{xi)h{xi, ... ,Xn) + + l)h{xi, ...,Xn) (3.27) 


• Mutation 



(3.28) 


• Level I Coalescence: At rate 


/i(xi, . . . , Xn) y h(Xj, . . . , Xj, . . . , Xj — l, Xj, Xj-|_l, . . . , Xn) 

• Migration: For each i,j G S, at rate , 

hl{Xii,Xi2)h2{Xji,Xj2) hi{Xii,Xj3)h2{Xji,Xj2) 


(3.29) 


(3.30) 


Here the first index indicates the deme and the second the rank at the given deme. 


• Level II selection: By (2.12|) and taking convex combinations it suffices to consider a level 


II fitness function of the form: 


V2{^^) = KB) 


h{-)^v2m-) + {i-v2{-))(^h{-) 


(3.31) 


Then the level II selection: for each i,j G S' in transitions at rate 


h{xil,Xi2) lB{xil)h{xi2,Xi3) -k (1 - lB{xjl))h{xil,Xi2) 


• Level II coalescence: for each pair i,j at rate ^ 


hl{Xii,Xi2)h2{Xji,Xj2) hi{Xii, Xi2)h2{Xi3, Xii). 


(3.32) 






3 DUALITY FOR INTERACTING AND TWO-LEVEL ELEMING-VIOT SYSTEMS 17 


3.2 A set-valued dual for exchangeably interacting systems of Fleming-Viot 
processes 


We now introduce the set-valued dual which will be used to study the interacting system of 
Fleming-Viot processes and then the limiting two-level Fleming-Viot process. This is based on 
the set-valued dual introduced in |^| (subsections 9.4, 9.5) for the system of exchangeably inter¬ 
acting Fleming-Viot processes but extended in order to include level II selection and resampling. 


We begin with the population at a set of demes labeled by S with 


S = {1,...,N2} 


(3.33) 


with migration between demes as defined in subsubsection |2.3.l| with the assumption of ex¬ 
changeability. Recall that the state space for the finite system of interacting Fleming-Viot 
processes is (R(][))'^.The set-valued dual is a rehnement of the function-valued dual sketched 
above. Noting that it suffices to work with linear combinations of indicator function the Level I 
function-valued and set-valued version of the above dual were introduced and studied in depth 


in Dawson and Greven |^]. 

We now introduce the state space and notation needed to define the set-valued dual Qt- 
Recall that I = {!,...,iF}. We indicate the indicator function of a subset ^ C I by = 
(ei,..., ex) with e* = 1 if i G ^4 and e* = 0 is 1 G A^, that is, the complement of A. For example, 
the indicator function of {1,2} C {1,2,3} is indicated by (110). We sometimes identify finite 
subsets with their indicator functions. 

Let 


Z := algebra of subsets of 

of the form A (gii I^, A is a subset of I™', m G N, 

the coordinates in a product in I"* are called ranks. Given A,B C I we denote the product of 
these sets in I x I as A (Di R. Given A,B dX we denote the product of these sets in X x X as 
A (8)2 B. 


The state space: for the set-valued dual associated to the interacting systems of Fleming-Viot 

processes with S = {1,..., N 2 } is the algebra of sets containing sets of the form 


(g) , Aij el, me N, (3.34) 

2,ieS 

e (X)®2^. 


In order to describe the dual dynamics we first describe the transitions that occur for a set 
written as a disjoint union of sets of the form ( 3.34 ) where in Aij C I the first subscript denotes 
the deme and the second subscript denotes the rank at the deme, and Vi{x) = 1b{x) with rate 


Si > 0. 
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The transitions of the set-valued process are obtained by restricting the function-valued 

transitions to indicator functions of sets in These are then given by: 

Level I selection at rank j* at deme G S' at rate s: 

.j* —^ Ri*.j* 11 .j*" 11 j* (3.35) 

and all other ranks larger than j* are also shifted to the right at deme i*. 

Mutation at rank j at deme i: (refer to Definition 5.12 and Subsubsection 9.5.3) 


Aij C I ^ Aij U {^} with £ G I at rate 

(3.36) 

kGA 


Aij -A Aij\{L} at rate ^ mi^k- 

(3.37) 




Coalescence at rate 71/2 of ranks ji and j2 > ji at deme i £ S: Aij^(S'iAij 2 — > = ^iJiOAij^ 

and Aij = Ajj+i for j > j2. 


Migration at rate ^ of rank j from deme ^2 G S to zi € S. Let Ai = 


d .7 02 Ai2 — y ( 8*2 Ai2 

(3.38) 

^h,n-i_+l = Ai2^j 

(3.39) 

^12,^ = ^12,(.+ 1 ^ ^ i 

(3.40) 

-4*2 ,£ = -4*2 for i < j 

(3.41) 


Remark 4 Note that in the limit N 2 —>■ 00 the measure v is nonatomic and migration or level 
II selection transitions always lead to a new (that is unoccupied) deme. 


Coupling: Note that every set in can be written as the union of a finite number of disjoint 
sets of the form ® 2 i=i j=i -^i,j with Aij C I and N £ N. Finally the above transitions 
are simultaneously carried out in this disjoint union of products and are coupled as follows: all 
selection, mutation, coalescence and migration operations are simultaneously applied to each 
rank at each deme of each product in the disjoint union. Each such transition preserves the 
decomposition of the disjoint union into a new disjoint union - this is obviously satisfied for 
mutation, coalescence and migration and true for selection in view of the specihc form (|3.35|). 
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3.2.1 The dual representation 

We now state the duality relation between the system of interacting Fleming-Viot processes X 
under the assumption 

X(0) = ^ith/i € P(I) (3.42) 

which implies that we have a system of exchangeably interacting Fleming-Viot processes. 
Define the function F : 7^(1) (g) I —>■ [0,1] by 

F(X,g)=N*{g) (3.43) 

where if X(0) = then X*(0) = ^ For example, if G = 

(S)iG5 with Gi € i, then 

= (3.44) 

165 


Theorem 2 Let X'^^ denote a solution to the martingale problem with generator _ Then 

(a) Dual Representation 


^x(o)(i"(X: 


N2 


^N2,int 
) ^0 


)) — E N2,int{F(N.^ 
Vo 


N2 
0 > 


->N 2 ,int 

H 


)) 


(3.45) 


(b) The representation ( 3.4^^ ) uniquely determines the marginal distribution of the process 
and therefore establishes the uniqueness of the solution to the martingale problem. 


Proof The proof in the case S2 = 0 is given in detail in ||24|| based on verifying that the 
generators of the two processes acting on the function F satisfy the relation 

GN 2 MtF(^^ A) = A) for all G P(I), Ag\, (3.46) 

where is the generator of the set-valued Markov jump process with transition rates given 

above. The extension to the case S2 > 0 follows in the same way and will be given in more detail 
for the two-level process below. □ 


Remark 5 State-dependent fitness 

We can also consider level I selection that is state dependent, that is in which the fitness of a 
type depends on the distribution of types (e.g. diploid). For example we could have the fitness of 
type 1 proportional to the population proportion of type B, that is, = sfi{B), B Cl, s > 

0. In this case the dual has function-valued transitions at rate s: 

/ ^ Is (2)1 [Ilf - li 1^1 /] + / 

= Ib(^i [1i/ + (1-1i)0i/] 

+{1 - Is) /. 
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A second example to a set-valued dual which can he used to analyse such systems, eg. mutu- 
alistic types (see Example^. Consider 1 = {1,2,3}. 


Vi{l)=vi, Vi{2,iJ,) = VM ■ Vi{3,p) = vm ■ 


with transitions 


/ ^ li/+ (I2 + I3) <8)1 / at rate'll, ( 3 . 47 ) 

/ ^ I3 (8)1 [I2/ + (li + I3) /] + (li + I2) / at rate vm, 

/ ^ I2 01 [I3/ + (li + I2) 01 /] + (li + I3) 01 / at rate vm- 


3.3 A set-valued dual for the two level Fleming-Viot process 


The objective of this subsection is to extend the set-valued dual of subsection in order to 
construct set-valued duals , Gf for the V{V{T))- valued processes {E^^{t)}t>o (assuming 
exchangeable initial configuration) and {H(t)}j>o. We assnme that the level II selection rate is 
S 2 and with fitness function V 2 and the level II resampling rate is To simplify the notation 
we take 7 / = 1 in ( 2.17 ) in the subsequent discussion. 


The state space: P* for the set-valued dual Gt is is the algebra of subsets of (I^)^ containing 
sets of the form 


(g) ( 038=1 A,j) , Aij G I, ni € N, (3.48) 

2,2<m 

G (X)®2"^, m G N, 

where i is the index of the deme and j is the index of the rank within the deme. The transitions of 
the dnal due to level I mutation, resampling, and selection at each deme and migration between 
demes are given as above in snbsection |3.4 In the N 2 00 limit, migrants always move to a 
new (nnoccupied) deme, namely the the nnoccupied deme of lowest index. 

We assume that V 2 belongs to the class of level II fitness functions of the form 

F2(^) = j;S2.iF2y(^), V2,{pi)=il^{lB,) (3.49) 

j 

and Bj = nr=i with Bji C I. 

We now introdnce the additional transitions that occur due to Level II selection and coalescence. 
For the former, nsing the linearity it suffices to describe the contribution to the dynamics of V 2 ,j 
with V 2 j{p) = fi®{B), B €l. 

As above we use the notation 0i and 02 to distinguish such products on I and I. Similarly for 
measures in z/j G 7^(7^(1)) we write vi 02 1^2 for the product measure. 


Set-valued transitions - deme level selection 
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Using the linearity it suffices to consider the contribution to the dynamics of with 
^ 2 ,j(/^) = Bj G Given such a fitness function and sets Ai ^ X with the sub¬ 

script indicating the deme, then for every k,i S the action of selection on deme k results in 
the transition 


^ I (2)i A}^ 


AJ U (g) A, 


2 ,ieS 


2 , i^k 


occurs with rate ^ if 5 = {1,, A^ 2 }- 
If S' = N, then this becomes 


2 ,ies 


Ai I Bk <8)1 Ak 


U (g) A, 


2,i=l 


2 , i^k 


2,i=l 


(3.50) 


(3.51) 


where Ai,Bi € X, n is the number of occupied demes (i.e. not identically I^) and £ denotes the 
first unoccupied deme, occurs with rate S 2 j- 


Note that exchangeability is preserved by the dynamics and we can again couple the correspond¬ 
ing indices after a level II selection event, that is all further transitions are applied simultaneously 
to the corresponding indices (deme and rank at the deme). For this reason we can rewrite ( p.51 ) 
as 


n 


2 , 2=1 


Bk <8)1 Ak 02 (g) Ai 

2 , i^k 



(3.52) 


where Ai = Ai'il i < k, Ai = Aj+i if i > k. This means that the new event is a disjoint union 
of events and this is preserved by all further transitions. 


Example: Let I = {1,2} with fitness V 2 (/r) = /r(l). Then given Ig = li, then the first transition 
(in terms of indicator functions) 


li —>■ li 01 li + li 02 I 2 


so that letting 

E^(fx(l)) = J 

we have 

E,(/r(l)) ^ E,(^(l)) + E,((m(1))2) - (EMm^ = EuiTil)) + Var,(^(l)) 
so that if ly is not a single atom probability measure the level II selection is effective. 


Set-valued transitions - deme level coalescence 
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The level II resampling results in the coalescence of two demes, for example demes 1,2 as 
follows: 



where the exterior subscripts denote the deme. 

When there is no level II coalescence and t'o is non-random, the {t —is deterministic 
and it suffices to consider ko = 1. The reason is that there is no interaction between the 
supports of the associated set-valued processes starting with disjoint supports in this case so 
that Var(f = 0. 


3.3.1 The duality relation for the two level Fleming-Viot process and its applica¬ 
tions 

We now present the dual representation for the two level Fleming-Viot systems. 

Let {H(t)}t>o denote a (I))-valued process with probability law 

which satisfies the (G 2 , Ll 2 )-iiiai'tiiigale problem. Then the time-marginals H(t) are random 
probability measure on 7^(11). Then by de Finetti’s theorem (see HI, Theorem 11.2.1) there 
exist a sequence {Jin} of P(]l)-valued exchangeable random variables such that {Jli,... ,Jin) has 
joint distribution 

P^'^\t,djli,... ,djln) = f E{t,dni).. .E{t,dfin)dP'^'^, n G N, (3.53) 

7c.p(.p(i))[0,oo) 

that correspond to the moment measures of E{t). 

Let Gt with values in 1 ^* denote the set-valued process defined above. 

Define the function R : {P{V{T))) x (X)®^’* —^ [0,1] by 

Ni N2,k Ni 

n{u,g) = YU [xf”'''(^fc,i)] i^(d/ij) iff? = U® Ak^i, with Ak^i C (!)”'=% 

fe=l i=l k=l 2,1=1 

^2,ki ^‘2,k2 

with (^ Ak^,i n (^ Ak 2 ,i = 0 if A:i / k 2 , 

2,i=l 2,i=l 

we can also write this as 'H{v,G) = 
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Theorem 3 Dual Representation 

(a) For any solution {-Psq : Hq G V{V{I))} of the {G^^,D 2 ) or {G 2 , D 2 )-martingale problem 

E^,{niEt,Gl)) = Eg,{U{Eo,gl)) (3.54) 

(b) The {G^^,D 2 ) and {G 2 , D 2 )-martingale problems are well-posed. 

Proof The proof for the cases {G^^,D 2 ) and {G 2 ,D 2 ) follow the same lines. We now give the 
details the the latter case. 

(a) As above, we begin by identifying the terms in G 2 H{v) for functions in D 2 of the form: 


fco 


H{v)= n 


k=l 


/•p(i) iJr-k 


h{xk,i, ■ ■ ■,Xk,nk)dLk 


(g)nfc 


v{dp.k) , 


in other words we work with functions of the form 


fco 

X\h{xk,i,...,xk 

,nk)- 

k=l 


(3.55) 


The transitions for functions of this form are given as follows: 
• Level I resampling. This results in the coalescence 


h{xii,xi2)tiidxu)fi{dxi2) 
Migration. At rate c 


v{dp,) 


h{xii,xii)ii{dxii) 


k'{dfi), at rate 71 
(3.56) 


h{xii,xi2)t^idxii)p.{dxi2) 


u{dp) 


(3.57) 


h{xii,X22)hl{dxil)H2{dX22) 


v{dpLi)v{dpi2)- 


Selection at level I with 


Vi{x) = si1b{x). 


(3.58) 


This results in the transition with 


h{xii) fj,{dxii)i'{dfi) 
r 

h{xu)lB{xu)fi{dxii) + I I lBc{xu)h{xi2)T{dxii)iJ.{dxi2) 


(3.59) 


^{dfi) 


at rate si. 



3 DUALITY FOR INTERACTING AND TWO-LEVEL ELEMING-VIOT SYSTEMS 24 


Under level II coalescence two occupied denies i, j & S, i ^ j are chosen at random at rate 72/2 
and we have 

(3.60) 

h{xii,xi2)h{xji,xj2)niidxii)^i{dxi2)^ijidxji)fij{dxj2) v{dui)v{dnj) 
h{xii,Xi2)h{xi3, Xi4)fi{dxii)fi{dxi2)nidxi3)fi{dxi4) v{d^j), 
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him) + (1 - V^2(m)) ® /l(/U2). 


The corresponding set-valued transitions are obtained by restricting the class of functions h of 
the form h{^) = fi'^{A) with A € Z as in subsubsection |^ . 

Coupling. In view of the assumption of exchangeability, we can couple the V 2 (//)/i(/r) and 
(1 — V 2 {fi)) terms for the level II selection transitions at the deme level, that is, place these at 
the same deme index in the two resulting summands thus producing a union of disjoint sets in 
P*. Then as before, all operations are performed simultaneously on all demes and ranks in the 
different summands. 


Set-valued transitions The set-valued transitions can then be read off by restricting the function¬ 
valued transitions to the class of functions h that are based on indicator functions of sets as 
in ( [1.61 ) and noting that due to the coupling the transitions preserve the decomposition into 
the union of disjoint subsets. These transitions define a Markov jump process {G^}t>o with 
countable state space P* and we denote the resulting generator by The identity of the 

action of the corresponding terms of G 2 acting on and the result of the transition of the 

set-valued dual, that is. 


G 2 'H{u, G) = G) for all € V{V{1)), G € {if 


(3.67) 


is then immediate by inspection. For example, setting V 2 (/u) = ^®{®i^iBi) and applying 
the corresponding selection transition from G 2 to H{v) = / 0*=! with /ij(/ij) = 

/rj((8)ijMjj), ( 3.66 ) yields the set-valued transitions ( |3.51|) with Ai = iSiijAij which correspond 
the 


The duality relation 
Chapter 4 of p7|)). 

The uniqueness of the solution to the martingale problem is then obtained. In particular, mo¬ 
ment measures of the time marginals of any solution to the martingale problem are determined 
by the dual representation. In turn the moment measures uniquely define the T(H(t)) as fol¬ 
lows: Let ^ 1 , ^ 2 ) ■ ■ • be an exchangeable sequence of 'P(I)-valued random variables with marginal 
distributions given by the moment measures determined by the dual Gt and 


3.54) then follows from (|3.67|) (see for example Proposition 7.10 in |17] or 



i=l 


(3.68) 


Then by de Finetti’s theorem 

C{Em) C{E{t)) as m ^ 00, 


(3.69) 


that is, the time marginal laws of any solution are uniquely determined by this limit. 

(b) Since the class of function of the form R{-,G) with G £ is probability-measure- 

determining on V{V{I)) this implies that the time marginals of E{t) are uniquely determined 
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follows from (a). The result (b) then follows from the basic results on dual martingale problems 


(see e.g. Theorem 7.9 and Proposition 7.10 in |n] or Chapter 4 of p7[|). 


□ 


3.3.2 Moment calculations 

The dual can be used to compute joint moments and covariance structures. We illustrate with 
two simple examples. 

Example 6 Consider the case o/I = {1,2}, no mutation and Pi(l) = l,Pi(2) = 0, c = 72 = 
S 2 = 0 but si, 7 i > 0. In order to compute 



(3.70) 


we use the the dual started with Qq = (01). Then we have transitions due to selection and 
coalescence. As a re.sult 


= (01)®i’"W 


(3.71) 


where n{t) is a birth and death process with linear birth rate n —>■ n + 1 at rate sin and quadratic 
death rate 7in(n — l)/2. As a result {n(t)} is ergodic with distribution measure {pk : k € N}. 


Then 



Next consider 



(3.73) 


In this case we start the dual with Qq = (10) (8)1 (01). Again the number of ranks is given by a 
birth and death process which will eventually reduce to n{t) = 1 which due to coalescence implies 


that Q{t) = 0 for some finite random time r. This implies fixation, that is, the limit (3.75) 


is zero. The corresponding fixation probabilities are then given by the limiting first moments 
calculated in ([3. 7i ). 


Example 7 With two types {1,2} we take Qq = (lO)f^^ 02 (lO)f’^^ and we get 

Eeo = (10)f 1 02 (10)f ^)] = Eg2[H{Eo,g^)]. 



3.3.3 Coalescent 

The coalescent plays a central role in the study of Moran, Fisher-Wright and Fleming-Viot 
processes with neutral types. We now consider the analogous genealogical structure for two- 
level systems with si = S2 = 0 which is determined by the level one and level two coalescence 
transitions in the set-valued dual. The genealogy is described by a marked coalescent process 
analogous to the marked coalescent process used for spatial processes (see for example p4[). 
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The state space of the two-level coalescent is the set of marked partitions (tt, C('?r)), vr G 
n^, C(vr) G where 11 ^ is the set of partitions of a countable set I into subsets and for tt G , 
| 7 r| denotes the number of subsets in the partition. The marks {C(*) -1 = 1,..., | 7 r|} represent 
the positions in N of the subsets and |C| = \{k G N : = k for some i G |vr|}}|. For 

i = 1,..., ICI) let ni{7r) denote the number of subsets with ^ * so that ni{7r) = | 7 r|. 

A subset in the partition can jump to a new unoccupied site at rate c and level I coalescence 
of two subsets occurs at rate 71 if they are at the same site. On the other hand all the subsets 
at two occupied sites combine to form a single site with all these subsets at rate 72 . 

Therefore given (|C('^)|) (^i) • • • )^|C( 7 r)|) = {k, (ni,... ,nk)) the possible transitions are: 

1 . {k, {rii ,... ,nfc)) {k + 1, (ni,...,n* - 1 ,..., 1 )) at rate cniln.>i, 

2 . {k, (ni,... ,nk)) (k, (ni,... ,nj - 1 ,... ,nk)) at rate 7 ini(ni - 1 ), 

3. {k, (ni, ,nk)) {k - 1, (m ,... ,ni + rij,... ,nk)) at rate 'y 2 k{k - 1). 

Proposition 3.1 (The multilevel coalescent) 

(a) Consider the case with si = S 2 = c = 0, 72 > 0 and 71 > 0. Then the two level coalescent 
converges to ( 1 ,( 1 )). 

(b) If c > 0 and 71 > 0, 72 = 0, then the coalescent process started at {ko, (ni,... ■,nkff)) with 
ko < 00 , converges to {k, ( 1 ,... , 1 )) for some random k. 

Proof (a) Due to jumps of type 2 at rate 72 the process will eventually reach an element of 
the type (1, (ni ,... ,ni + nj,..., ni)). The due to jumps of type 1 the element (1, (1)) is then 
reached in a hnite random time. 

(b) If 72 = 0, the number of occupied sites is nondecreasing and at each site level I coalescence 
leads to a single element. 

□ 

4 Long-time behaviour of two type multilevel population sys¬ 
tems 

The class of two-level Fleming-Viot systems obtained by Theorem ^ with dual representation 
given by Theorem ^ describe a rich class of population systems. The evolution of their structure 
over different time scales depends on various parameters including mutation, levels I and II selec¬ 
tion, migration, and demographic stochasticity rates and lead to different classes of behaviours. 

In this section we consider the simplest case of a system with two types I = {1,2}, no 
mutation, migration rate c, levels I and II selection rates si,S 2 and levels I and II resampling 
rates 71 , 72 - We hrst consider the deterministic case (i.e. infinite population case at both levels, 
7 i = 72 = 0 ) and then in the random cases 71 and/or 72 > 0 . 

4.1 Nonlinear measure-valued dynamics with 71 = 72 = 0 

Proposition 4.1 Consider the two level system in which 72 = 0 and P(Ho = z^o) = 1, z^o £ 
V{V{T)). Then the V{V{T))-valued process Et = vt is deterministic. 
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Proof This follows immediately from the dual representation since the variance of Var(J’ h(//)St(/r)) 
0 for any /i € El. This can be shown to be zero since all i?[(/ /i(/r)Ht(/r))^] is obtained by starting 
the dual with 

h{ni) ( 2)2 h{n2) 

and since no coalescence occurs the two descendant sets evolve independently. □ 

Now assume that the mutation, migration and genetic drift parameters are zero and 

Pi(x) = 1d(x), Del, (4.74) 

and level II fitness function of the form: 

^2{t) = with Del. (4-75) 

(This can be generalized to V2{l) — with B e (I)^ for some K but we consider here 

the case K = 1 to keep things simple.) 

The transitions of the dual due to level I, respectively, level II, are given by 


Ic{xii) ^ lDnc{xii)+ Id<^{x\i) ®ilc{xi 2 ) at rate Si, (4.76) 


lc(a;ii) 1 b(xii) ( 2)1 lc(a^i2) + lB=(a:ii) (8)2 lc(a;2i) at rate S2. (4.77) 


Theorem 4 Assume that I = {1, 2} with fitness functions 


Pi(I) = 0, I/i(2) = l, 


(4.78) 


and 

V2{ti) = ti{l), (4.79) 

selection rates si > 0, S2>2 and all other parameters equal to zero. Then as t ^ oo, 

(a) If si > 0, S 2 = 0, and uq (5(i^o); i^en ut ^ {1 - z^o((l, 0)))(5(o,i), 

(b) If Si = 0, S 2 > 0, for some xq € (0,1], r'o({M • ^ ^jq}) > 0, then vt —>■ (5(p*^i_p*) with 

P* > J p.{l)no{dfi). 


Proof We compute ^E,^g{J //(l)t't(d/i)) using the equivalent dual form (77(1^0; I?t))- 

To compute the latter consider the first dual transition, either a level I selection jump ( [1.761) or 
a level II selection jump ( 4.77 ). This yields li —>■ li (2)i li at rate si and li —>■ li (2ii li + li (2)2 I2 
at rate S 2 and therefore 


^Di(/i(l)) = S2Var2,i(/r(l)) - si(Di(^(l)) - (^^(^(l)))^) (4.80) 

where 

= j Var2,i(^(l)) = J p.{lfnt{dti) - (J p.{l)i't{dp)f 
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(a) If S 2 = 0, then ([4.80|) yields the ode 




(4.81) 


and the result follows immediately. 


(b) If Si = 0, then by ( 4.8C1| ) Et{fi{l)) is nondecreasing and strictly increasing if Var 2 ,i(/u(I)) > 0. 
Since £'(^(1)) < 1 and Var 2 ,t(/i(l)) > 0 unless u = 5{^x,i-x) for some x G [0,1], limi_^(X) Var 2 ,t(/^( 1 )) 
0 and vt with p* > f fi(l)iyo(d/u). □ 

In the case Si = 0, S 2 = 0, the system is deterministic. In order for level II selection to play a 
role, diversity in the composition of clusters is required as first pointed out by Maynard Smith 
(recall discussion in subsubsection l.I.l). In particular consider the case where t'o £ 
let uo(dx) := ■ /^(l) € dx}) G P{[0, 1]). The following is a version of a result of Luo [^]. 


Theorem 5 Assume ( 4- ) and ( 4- 7 ^ ), c = 0, and si > 0. Assume thati/Q{dx) has a continuous 
density iiQ{x). Then 

(a) If there exists xq < 1 such that r'o([xo, 1]) = 0, then vt —^ '^( 0 , 1 )• 

(b) If the density Vq is continuous and positive at 1, then there exists a critical S 2 such that if 
S 2 < S 2 , then vt —>■ <5(0, 1 ) and if S 2 > S 2 , then there exists an equilibrium distribution with 
Veqiih : /.i(l) > 0}) > 0 . 


Proof This was established by S. Luo |5I] (see (A.5) in the case r/ = 17) by solving an integro 
partial differential equation for r'i(x), the density of the distribution of /xi(I). Luo’s equation is 
given by 


with solution 


^^v{t,x) =-^{v{t,x)x{l - x)) + \v{t,x)[x - j v{t,y)ydy] (4.82) 

= ^0 (y i + ^^U-l) ) + x{e^ - l)](^-2) (4.83) 

where A = h{t) = y^{t, y)dy. 

(a) For any e > 0 there exists to{e) such that for x > e and t > to{£), > xq and 

therefore the right side of ( 4.831) is 0. 

(b) In the case /tq is uniform on [0,1] and A 7 ^ 1, (|4.83 ) becomes 


where A = S 2 /S 1 . When A < 1, we have 


(A-2) 


f 


xv{t, dx) 0 . 


(4.84) 


(4.85) 


Veq{I) = lim v{t, 1) A - 1, 

t^oo 


When A > 1, we get 


(4.86) 
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and 


f 


xv{t, dx) 


A- 1 
A ' 


(4.87) 

□ 


From Theorem ^(a) it follows that the survival of type 1 depends only on the initial density 
U{0,x) near x = 1 and (|4.8(: ) remains positive if the initial density is positive at x = 1. This 
means that level II selection can overcome level I selection with two types only if there is a positive 
density of sites (subpopulations) at time t = 0 with arbitrarily small (or zero) proportions of 
the individually more fit types. We next consider the role of randomness and demonstrate that 
level one positive genetic drift can lead to a phase transition with the possibility of survival of 
inferior types for any initial uq provided that f ;u(l)zzo(d/x) S ( 0 , 1 ). 


4.2 Phase transitions: the role of randomness 

In this subsection we again consider the case in which level two selection favours colonies that 
include altruistic or cooperative types. In the previous subsection level II selection exploited the 
diversity in the initial distribution among demes. 


4.2.1 Level I randomness 

In this subsection we now consider the parameter regions for dominance of level I or level II 
selective effects and the transition between these phases where level II selection acts on the 
diversity in deme composition resulting from local genetic drift at each deme, that is, when 
7i > 0- 

Theorem 6 Consider the case I = {1, 2}. Assume that , c > 0, 71 > 0, 72 = 0, si > 0, S 2 > 0, 
with f fr(l)iyQ(dfJ,} G ( 0 , 1 ). 


Pi(l) = 0, Pi(2) = si. 

The migration rate is c and the deme fitness is 

V2{p) = S2^r{l). 

(a) Assume mi 2 = m 2 i = 0 and S 2 = 0, si > 0. Then fr(2)i'(d/a) —>■ 0 with exponential 
decay rate. 

(b) Assume mi 2 = m 2 i = 0 and si = 0, S 2 > 0. Then |u(l)z^(d^) 0 with exponential 

decay rate. 

(c) Assume mi 2 = m 2 i = 0. Then for fixed c > 0, 71 > 0, si > 0, there is a critical value 
^ 2 ( 0 , 7 , si) € (0,oo) such that level II selection dominates, that is then for e > 0, 


: /i(2) > e}) ^ 0, 
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if S 2 > ^2 level I selection dominates if S 2 < S 2 , that is, for e > 0, 

■ Ai(l) > £}) ^ 0. 

(d) Assume that mi 2 ,rn 2 i > 0. The there exists a unique equilibrium and the system converges 
to the equilibrium measure as t ^ 00 . 

Proof (a) and (b). We use the dual {Gt} with initial value Go = {2} ( or {1}) to compute the 
first moment of E^^ [/ fj,{2)Et{dfr)] = Eg^ [H(Eo,Gt)] where Hq = 6^^. Let p = po{2). 

We now identify two basic combinations of transitions that will either increase or decrease 
this expression. 

(1) Level I selection followed by migration before coalescence. 

Transitions in terms of indicator functions of the sets, 

12 ( 1 ) —> 12 ( 1 ) + li(l) 12 ( 1 ) level I selection at rate si 

12 ( 1 ) + li(l) ( 8)2 12 ( 2 ) migration 

so that after integration we obtain 

p —> p + (1 — p)p = 2p — = 1 — (1 — p)^. (4.88) 


Note that after the migration step, this change can no longer be reversed by a coalescence 
at site 1. Noting that migration before coalescence occurs with probability 2 c+-yi that this 
combination occurs with effective rate si 2 c+'yi • 

Similarly these transition result in 


li(l) ^ 0 + li(l) ( 8)1 li(l) level I selection 
—>■ li(l) ( 8)2 li( 2 ) migration 
( 1 -p) -^{l-pf 


In the absence of level II selection this can be identified with a Crump-Mode-Jagers (CMJ) 
branching process (see |^|(3.1.4), [^] ) in which individuals are occupied sites and during 
their lifetimes these individuals produce new offspring sites. Since the death rate is zero this 
is a supercritical branching process with exponential growth rate, the Malthusian parameter ai 
(see 1^] subsubsection 8.3.4) which can be represented in terms of the stable age distribution 
,(8.167). This proves (a). 


(2) Level II selection. We consider the effect of a level II selection event, followed by coalescence 
before migration. 

12 ( 1 ) — >■ Ii(I) <81 12 ( 1 ) + 12 ( 1 ) <82 12 ( 2 ) level II selection at rate S2 

^ Il(I) ®2 12(2) + 12(1) ®2 12(2) = 12(2) 

2 c 

migration before coalescence with probability - 

2 c + 7 i 

Ti 

—>■ 0 + 12 ( 1 ) <82 12 ( 2 ) coalescence before migration with probability - 

2 c + 7 i 
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so that after integration we have 


p ^ p, P^P^, (4.89) 

in the two cases. 

Similarly in the second case (coalescence before migration) we have 
li(l) —> li(l) ( 8)1 li(l) + 12 ( 1 ) < 8)2 li(2) level II selection 

Tl 

—> li(l) + 12 ( 1 ) li(2) coalescence before migration with probability -, 

2 c + 71 

so that after integration we have 

(1 - p) (1 - p) + p(I - p) = 1 - (4.90) 

and this occurs with effective rate S 2 2 ^^ ■ In the absence of level I selection this again produces 

a supercritical CMJ branching process with Malthusian parameter 02 in which individuals are 
occupied sites (where an occupied site is a site at which the set is not I). This proves (b). 

(c) When both si > 0, S 2 > 0 there is a competition between the two levels of selection. In the 
dual setting this corresponds to the the competition between two branching mechanisms, that 
is, competing CMJ branching processes. 

We now focus on the interaction of the competing CMJ processes. To do this we first consider 
effect of a level I transition at a site where a level II transition has already occurred. This leads 
to 

I 2 -^II I 2 ®2 I 2 -^I I 2 C 12 I 2 + I 2 ®2 ll ®2 I 2 ; (4.91) 

that is, in terms of sets 

{2} D ({2} 02 {2}) C ({2} 02 {2}) U ({2} 02 {1} 02 {2}) C {2} 
which after integration leads to 

p^77P^ ^7P^+p^(l-p) G [p,p^]. (4.92) 

This means that a level I transition after a level II transition partially reverses the first effect 
(but does not overshoot the reversal). (The same happens if a level II follows a level I transition.) 
Therefore we can obtain a bound to the decrease in the mean of type 2 due to level II transitions 
by completely reversing them at the rate of level I transitions. 

First, assume that > 827 ^^- We can construct a birth and death process with births 

(new factors li(-) via level I selection) at rate We note that we can obtain a domination 

by letting the action of level II selection to remove a li(') (i.e. replacing it by I 2 UI rather than 
something intermediate), that is this produces a death rate 527 ^^- Therefore the resulting birth 
and death process is supercritical and goes to infinity. Thus in this case, if f p{2)vQ{dp) > 0, 
then f ii,{2)ut{dpL) —)■ 1 and type 2 takes over in the limit in the McKean-Vlasov system. 
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Similarly, if 52^;^ > then we have a birth process with I2 factors produced due to 

level II selection at rate S 2 and a removal of these factors by level I selection at rate si . 
Therefore if f fi(l)i^Q(d/u) > 0, then f —>■ I and type 1 wins out and takes over in the 

McKean-Vlasov system. 

(d) This is a special case of Theorem 

□ 


Remark 6 We can also consider the linear stability of the fixed points (5(i^o)) '^(o.i) computing 

^^(£,i-£)(li(l))|t=o = S271 - sic 

for small e (e.g. under the condition S 2 < c discussed below). This is given by the first ehange 
due to a jump of the dual process started at li. This means that level II selection prevails if 


S2H > SlC. 


(4.93) 


Condition ( j.QS ) was derived by Kimura [jCj using the properties of the equilibria of the solution 
of equation lA for the density U{t,x). Existence and uniqueness of the solution to 0 was 
established by Shimakura namely, for any initial distribution there exists a unique solution. 

Also see Shiga |^/ for a generalization to the multitype case. A more complete deseription of the 
long-time behaviour is proved in Ogura and Shimakura Theorem 3(in) (in the ease S 2 < c 
so that si/71 < 1 if also S 2 > sic/71 which corresponds to the case in which the expected number 
of dual factors at a site is bounded above by 2). They also obtain a related result in Theorem 
5 in the case in which the mutation rates satisfy mi 2 > 0, m2i > 0 or mi2 > 0, m2i = 0 or 
mi2 = 0, m2i > 0. These results were obtained using ODE methods and explicit solutions in 
terms of hypergeometric functions. 

The ODE methods cannot be extended to populations having more than two types or systems 
with level II coalescence. The objective of this paper is to develop the dual representation and the 
above model of competing branching mechanisms to study the long-time behaviour of multitype- 


multilevel systems. An example involving three types is given in Theorem 1C. 


4.2.2 The case 71 > 0, c = 0 

Consider the case again with no mutation, level II fitness function V 2 {p) = Ai(2), S2 > 0, 
c = 72 = 0 but with uq = that is, no initial diversity in the composition of demes but with 
/io(*) > 0. z = 1, 2. Then level II selection can dominate, that is, /Ut(2) — 1 only if 71 > 0. This 
means that group selection cannot be effective if 71 = 0 in the case vq = . 

In the case c = 0, 72 = 0, 71 > 0, fixation occurs within each deme so that eventually the 
competition is between demes of type 1 and demes of type 2 and then /it(2) —>■ 1 if S2 > 0. To 
verify this we have 


(10) (01) (8)1 (10) -|- (10) (812 (10) at rate S 2 

—> (10) (812 (10) at rate 71 


kcond 


(4.94) 
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(10) ^ (10) + (01) 01 (10) at rate si (4.95) 

—> (10) at rate 71. 

Recalling that the coalescence dominates level I selection here due to the quadratic rate then 
effectively the dual develops as (lO)'^^^^*^ and k{t) —>■ 00. Then if //o(l) < I, I't = and 
^ 0 as t —7> 00. 

4.2.3 Level II randomness 72 > 0 

Proposition 4.2 (a) // 71 = 0, ^o(2) < 1- i^o-a-s., and si > 0, 72 > 0, c > 0, then 


E 


fj,{2)Et{dn) 


—7> 0 as t —>■ 00. 


(4.96) 


(b) If >0 and 72 > 0, then fixation occurs, that is, 


E ^jL{l)Et{d^jL) • j fi{2)Et{d^)^ 0, as t ^ 00 . 


Proof (a) This follows by a simple dual calculation as follows: 

(01) ^ (01) 01 (01) as rate si, 


(4.97) 


(4.98) 


(01) ^ (01) 01 (01) ^ (01) 02 (01) at rate c, (4.99) 

but then 

(01) 02 (01) ^ (01) 01 (01) at rate 72, (4.100) 

and 

(01) ^ (01) 01 (01) + (10) 02 (01) at rate S2 (4.101) 

—>■ (01) 01 (01) + (10) 01 (01) = (01) at rate 72 

Recalling that the coalescence dominates level II selection here due to the quadratic rate then 
effectively the dual develops as (01)®i^(*) and k{t) 00. This implies that level II selection has 
no long term effect and the level I selection leads to J (/i(2))^(*Vo((i/r) —>-0, as t ^ 00 . 

(b) This follows by a two-level version of the calculation in Example □ 
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5 Multitype multilevel population systems 

In this section we consider the evolution of more complex population systems with possibly 
many types, multilevel structure and different combinations of the basic mechanisms. In this 
case it is no longer possible to use the methods of one dimensional nonlinear dynamics or one 
dimensional diffusions, that is, methods involving ordinary differential equations. We will outline 
the application of the dual representations in some examples in this class. 


5.1 Equilibria and fixation probabilities 


An important application of the dual is to obtain results on ergodic properties of multilevel 
systems. We begin by reviewing the results for the case with only level I selection in |24]. 


Theorem 7 Ergodicity of McKean-Vlasov systems with level I selection. Assume that 72 = 0 
and that at time t = 0, all demes have the same individual distribution, say fio, that is uq = 5^^. 
If the mutation rates are positive on I and positive migration rate c > 0, then the limiting 
empirical process is given by a deterministic McKean-Vlasov dynamics {i^t}t>o where vt = 
Moreover, as t ^ 00 , ut converges to a unique equilibrium u^q. 

Proof See [^] Theorem 12, Theorem 14 for the case S2 = 0. The extension to include the case 
S2 > 0 follows along the same lines. □ 

We now consider the extension of the ergodicity result to the case in which 72 > 0. 


Theorem 8 Ergodicity of two level Fleming-Viot systems. Assume that 72 > 0 and S 2 > 0 and 
positive mutation rates on I. Then the law of the two level Fleming-Viot process converges to a 
unique equilibrium P^q G 


Proof We adapt the proof of |Q. Due to level II coalescence the number of sites occupied 
by the dual can be reduced to 1 and this event is recurrent. Note that the equilibrium mean 
measure of a subset of I is given by the probability that the dual process starting at the indicator 
of the subset hits the absorbing point but this occurs with positive probability at each time 
the event occurs. Therefore an absorbing point is reached with probability 1. □ 


We now consider the case in which the mutation Markov chain has two or more ergodic 
classes. In this case the system can be non-ergodic. If 71 > 0, then eventually the population 
will be concentrated on one of the ergodic classes and the problem is to compute the fixation 
probabilities. 


Theorem 9 Fixation probabilities 

(a) Single deme. Assume that the initial configuration is iid fiQ and that the mutation chain 
has two or more ergodic classes 71 > 0, and si > 0. Then the population is 

ultimately fixed on one of the classes and the probability that it is in class k is given by the 
equilibrium measure of the dual chain started with Qq = Ik integrated with respect to the 
initial measure /io • 
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(b) Two level Fleming-Viot systems. Assume that 71 > 0 , 72 > 0 , Vi, V2 > 0 and that the 
mutation Markov chain has two or more ergodic classes. Then there is ultimate fixation 
of a single ergodic class and the law of the two level system converges to a random mixture 
of pure equilibrium single class populations. 

Proof (a) We again use the dual representation. To verify ultimate fixation take Go = 

, 2 < < L Then due to the quadratic rate coalescence, with probability 1, t/,- = 0 at some 

random time r and this is a trap so that Gt = ^ for large t. To compute the probability that 
ergodic class k is chosen, start the dual with Go = Ifc- The dual process jumps are within class 
mutation jumps, selection jumps and coalescence jumps. Due to coalescence the state ^ is 
recurrent and therefore the dual is positive recurrent and converges to equilibrium as t —>■ 00. 
The fixation probability (limiting probability that ergodic class k is chosen) is then obtained by 
integrating with respect to the initial measure /U®. 

(b) We again use the dual process. Due to level II coalescence there will eventually be an 
equilibrium distribution of the number of sites occupied by the dual and in fact the single site 
situation occurs infinitely often. Then each time there is non-zero probability that one of the 
classes will be eliminated and the within class mutation will hit an absorbing point as in (a). 

□ 


Example 8 Assume that I = {1,2, 3}, no mutation, 71 > 0, 72 > 0 and uo = • 

We use the set-valued dual Gt- UOo = {j}2, then by level II coalescence followed by level 

I coalescence, Gt = ^ at some finite random time r if i j and therefore only one type survives. 
We also note that the dual process Gt starting from Go = is positive recurrent with {z}i as 
a renewal point - this is due to the levels I and II quadratic rate coalescence events (in contrast 
to the linear birth rates due to selection events). 

Then “level II fixation” occurs, namely. 


vt —^ with probability pi 

and the fixation probabilities pi are obtained by integrating the equilibrium dual Gecfif) starting 
from Go = {i}i, that is. 

Pi = V®{Gec^{i))- 

5.2 Examples of multilevel effects 

The class of multitype multilevel population systems with mutation and selection is extremely 
rich and can exhibit many complex dynamical behaviors. We do not discuss this in detail but 
now given an illustrative example of a simple effect of this type 

5.2.1 A model of cooperation. 


Consider the 3 type case I = (1,2, 3} with level I fitness function 

Fi(l) = ui, I/i(2) = Di(3) = 0, 
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and level II fitness function 

V'2(/i) = U2^(2)/x(3), 

with vi,V 2 > 0. This models a cooperative interaction of two types 2 and 3 that endow a deme 
containing them with a positive advantage. 

We consider the emergence of demes having population distributions (pi,P 2 ,P 3 } when 72 = 0 
satisfying 

VlPl < V2P2P3- (5.102) 


Theorem 10 Assume that c > 0, 71 > 0, rare mutation from type 1 to types 2,3 occurs at rates 
mi 2 = mi 3 = £ > 0 and m23,777,32 > 0, 777,21 = w-31 = 0. 

(a) If V 2 = 0, then for sufficiently large vi > 0 we can have long time survival of type 1, that 
is an equilibrium with positive mean proportion of type 1. 

(b) There exists a critical value V 2 at which emergence of demes having threshold values 
satisfying (\5. 1 Oi) occurs for V 2 > V 2 , type 1 becomes extinct and types 2, 3 go to equilibrium. 


Proof (a) This has been proved for the case of two types in Corollary 10.1. The proof of 
(a) follows along the same lines and elements of it are used in the proof of (b). We will outline 
the main steps. We use the dual started with Qq = {1} (8> The corresponding indicator 
function representation starts with (100) (8) (111)®^. Then selection followed by migration leads 
to new summands. Mutation has the effect of eliminating summands so that together we have 
a birth and death process. There are two possible outcomes. First the (100) can be a recurrent 
point and eventually will eventually change to (000) with probability 1. The second possibility 
is that the birth and death process is supercritical and in this case the type 1 is positive for the 
invariant measure. 

(b) We consider limi_j.oo f p(2U3)nt(d/i) and limt^oo f Note that (using indicator 

functions of subsets of I) the level II selection transitions gives 


(Oil) ^ 

(010)01 (001)01(Oil) 

+ [(100) 01 (111) + (010) 01 (110) + (001) 01 (111)] 02 (Oil) 


In order to produce a permanent summand we then require either two migration or mutation 
transitions before coalescence of the first two terms but this occurs with positive rate so that 
this can have an important effect for large V 2 . This can then lead to 


(Oil) ^ 

(010)01(001)01(Oil) 

+ [(100) 01 (111) + (010) 01 (110) + (001) 01 (111)] 02 (Oil) 
... ^ (Oil) + (100) 02 (Oil) 

Level I selection leads to 


(Oil) ^ (Oil) 01 (Oil) 
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If we then have a level II selection 


(Oil) ^ (Oil)(Oil) 

^ (010) (8)1 (001) (Oil) (Oil) 

+ [(100) (111) + (010) 8)1 (110) + (001) 8)1 (111)] 82 (Oil) 81 (Oil) 

^ . (Oil)81(Oil) + (Oil) 81 (100)82 (Oil) 


We then have competing branching mechanisms one whose rate is proportional to ui and one 
whose rate is proportional to V 2 - We can construct birth and death processes where the deaths 
in the level I process (birth rate prop, to ui) are caused by mutation and level II selection. 
Deaths in the level II process (birth rate prop to U2)are caused by level I selection. Recall that 
type 1 can only survive if the level birth and death process is supercritical (cf. DG, p. 743). We 
again have a dichotomy involving the critical behaviour of two competing branching processes 
and the result follows as in the proof of Theorem ^ □ 


5.2.2 Emergence of mutualistic types 


There are many different mechanisms involving multilevel selection that can influence the overall 
population structure. For example, multilevel selection can make possible the survival of a trait 
that then leads to the emergence of another trait that has a mutually beneficial effect which 
then gives the first trait higher (inclusive) individual fitness and the pair survives locally. 

Let 


To illustrate this consider the following modification of the model of subsubsection 5.2.1 


Vi(l) = ui, Ti(2,/r) = UM •/u(3), Fi(3,p) = um •+(2), 
and level II fitness function 

V2(^) = U2(/r(2) +/x(3)). 


Theorem 11 Assume that c > 0, 71 > 0, rare mutation from type 1 to types 2,3 occurs at rates 
mi 2 = mi 3 = £ > 0 very small, and with vi,V 2 ,vm > 0. Denote by Xi(t) the expected proportion 
of type i at time t. 

(a) Consider the deterministic case with 71 = mi 2 = m,i 3 = 0, = 1 and V 2 = 0 and for 

simplicity assume that ^ 2 ( 0 ) = ^ 3 ( 0 ) > 0. Then if 


VM < 


2 

l-xi(O)’ 


then 

xi{t) —>■ 1 


as t —)• 00 . 

(b) If 


2 


VM > 


1 - xi(0)’ 
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then xi{t) -i- 0 as t -i- oo. In the case vm = there is an unstable equilibrium. 

(c) Consider the two level system. Assume conditions on uq, namely on the density at 1 of the 
total mass of types 2 and 3 (as in Theorem ^ and that vm > 2. Then for sufficiently large V 2 
type 1 becomes extinct and the mutualistic types dominate. Moreover they continue to dominate 
even if the level II seleetion ends, that is, V 2 {t) = l\i^ ^{t)v 2 for suffieiently large T. 

In this case we have a two stage proeess 

Stage 1: Ignoring the mutualistie effect, survival and equilibria of altruistie types 2,3 by level II 
selection. 

Stage 2: Takeover by the mutualistie pair 2,3. 


Proof (a,b) Using the dual as above we can obtain the following equations by decomposing 
the first transitions of the dual into the different cases (recall ( |3.47 )): 

LM = ^,(0)1(1 - x,(o)) - ^{1 - 


The result follows by checking the sign of the derivative. 

(c) By Theorem in the two level system, under appropriate conditions on the density of the 
total mass of types 2 and 3 at 1, for sufficiently large V 2 there is an equilibrium with mean mass 
of types 2,3 greater than that is 1 — xi(0) > and therefore by part (b) type 1 becomes 
extinct and the mutualistic types dominate. Moreover they continue to dominate even if the 
level II selection ends. 

□ 


Remark 7 Since the mutualistic pair can then persist even in the absence of further level II se¬ 
lection in (c), this illustrates the possible role of transient group selection in the long time genetie 
structure of a population. Without the role of level II selection, the simultaneous emergence of 
both types with at least critical density would he an event of higher order rarity. 

Remark 8 We can also consider a random version of this. In this case we assume that vq = 
'^(1,0,0); 7i > 0; ^12 = 'niiz = e > 0. Then again for sufficiently large vm o,nd V 2 type 1 dies 
out and the mutualistic types take over. The dual analysis involves four classes of selection 
birth events corresponding to the level I fitness of type 1, the mutualist fitnesses of types 2,3, the 
level II fitness of sites containing types 2 and 3 and level I coalescence. This will be carried out 
elsewhere. 

5.2.3 A three level system 

Consider a system with state space ^(^(^(II))). For example, this could model a system of com¬ 
peting regions in which regions contain competing towns and each town contains a population 
with type space I. The relative fitness V3(z^), of a region (level III fitness) is assumed to depend 
on the distribution n of the characteristics of the towns it contains so that V3 : {V{V{I))) —>■ [0,1] 

K 

o = n 

k=l 


j hk{pik)v{dpLk) 


(5.103) 
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where hj. is of the form ( 2.2C ), and convex combinations of functions of this form. We also allow 
individuals to migrate to a different region and even entire towns to move to a different region. 
Then the combined effect of three levels of selection can lead to complex behaviour which can 
be analyzed using the set-valued dual with values in 


5.3 The study of more complex multilevel interactions - set-valued Monte 
Carlo approximation 

For more complex multilevel interactions it is natural to consider simulations since closed form 
solutions cannot be expected. However the numerical simulation of systems of Fleming-Viot 
processes involves the solutions of systems of nonlinear stochastic partial differential equations 
with degenerate boundary behaviour. The numerical simulation of such systems of stochastic 
differential equations is difficult. On the other hand the dual formulation involves only the 
simulation of continuous time Markov chains with discrete states which are easy to simulate by 
Monte Carlo methods. In particular one can compute means and covariances directly by simu¬ 
lating the dual process and determining the empirical distribution of the outcomes, namely, of 
the absorption probabilities (for equilibria) or equilibrium probabilities for fixation probabilities. 

5.4 Extensions 

The study of multilevel evolutionary systems arises in evolutionary biology, ecology, virology, 
sociology, economics, etc. These give rise to a wide range of mechanisms and interactions. 
Some of these can be modeled by extensions of the models and methods described above. For 
example, the models and the set-valued dual representations can be extended to systems with 
countably many types, recombination and horizontal gene transfer, higher level interactions, 
random environments, multiple species, and measure-valued processes with jumps leading to 
duals with multiple coalescence. 
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